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ABSTRACT 

Motivated  by  the  recurring  use  of  the  generalized 
Gaussian  family  to  model  different  underwater  noise 
sources  and  the  asymptotic  performance  levels  of  some 
commonly  used  detectors  for  this  family,  we  examined 
the  performance  of  these  detectors  for  several  different 
underwater  noise  sources.  The  sources  considered  are 
non-Gaussian,  highly  correlated,  generally  non¬ 
stationary,  and  vary  from  being  lighter  to  heavier 
tailed  than  Gaussian.  Although  the  linear  detector 
had  the  best  performance,  the  linear  rank  detector  and 
the  L-level  uniform  quantizer  consistently  had  similar 
performance  levels.  Considering  the  simplicity  and  the 
robustness  of  the  quantizer  and  the  rank  detectors,  ei¬ 
ther  of  these  detectors  would  seem  to  be  the  best 
choice  for  these  environments,  j  ; 

1.  INTRODUCTION 

Consider  the  simple  binary  hypothesis  test: 

H0  :  X  =*  U  ffi  X  =  E  +  1  (1) 

Let  X  represent  the  M-vector  of  samples  received  by  the  detector, 
(V  the  noise  samples,  and  4  the  samples  of  the  known,  positive  sig¬ 
nal.  For  the  following  analysis,  let  fa  denote  the  M-variate  den¬ 
sity  of  the  noise  samples  £/.  Define  the  performance  parameters  of 
a  detector  to  be 

a  =  Probability  (  decide  Hx  |  H0  true  )  (2a) 

8  » Probability  (  decide  Hx  |  Hx  true  )  (2b) 

The  parameter  a  is  defined  as  the  level  or  the  false  alarm  probar 
bility,  while  the  parameter  8  i»  the  power  or  the  performance 
level.  A  detector  is  said  to  be  optimal  if  for  any  level  a  <  a,  the 
power  8  is  maximized.  As  this  is  recognized  as  the  Neyman- 
Pearson  (NP1  criterion,  the  resulting  optimal  (parametric)  detec¬ 
tor  will  be  [I] 
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2.  BACKGROUND 

2.1  The  Data 

Four  different  data  sets  were  considered  in  this  analysis.  A 
summary  of  these  environments  is  contained  in  Table  1.  where 
the  entries  on  the  Shrimp,  Merchant,  and  Seismic  data  sets  have 
been  taken  from  [7|.  The  largest  data  set  under  consideration  is 
Arctic  under-ice  noise.  The  data,  FRAM  II,  was  recorded  on 
April  23,  1980  at  11:30  -  11:40  pm.  An  omnidirectional  hydro¬ 
phone  was  suspended  91m  in  4000m  water  and  radio  linked  to  an 
analog  recorder.  Subsequently,  the  data  was  analog  filtered,  0  01 
-  5  kHz  passband,  and  lowpaas  filtered,  2.5kHz  cutoff  frequency. 
Finally,  the  data  was  digitized  with  a  sampling  frequency  of  10 
kHz  [8].  Following  Dwyer  [8]  and  Veitch  and  Wilks  [9|,  the  data 
set  was  segmented  into  6006  records  with  each  record  containing 
1024  samples. 

The  second  set  of  data  was  collected  in  the  Pacific  OceSn  off 
the  coast  of  Hawaii  and  is  the  sound  of  shrimp  snapping  their 
claws  together.  When  the  claws  of  the  shrimp  snap  together  high 
amplitude  bursts  which  last  from  two  to  five  milliseconds  are 
created.  The  Merchant  Shipping  noise  was  recorded  by  a  single 
microphone  which  was  suspended  in  the  Indian  Ocean  near  a 
known  merchant  shipping  lane.  The  final  data.  Seismic  noise, 
was  recorded  on  the  continental  shelf  near  a  region  of  artificially- 
induced  seismic  activity. 
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Noise 

Sampling 
Rate  IkHzl 

Length 

Comments 

Arctic 

10. 

6159144 

non-stationary, 
non-Gaussian,  four 
distinct  modes 

Shrimp 

40. 

95232 

non-stationary, 
non-Gaussian, 
heavy  tails 

Merchant 

1.25 

102400 

stationary, 
light  tails 

Seismic  1 

0.892 

95232 

non-stationary, 
heavy  tails 

Seismic  2 

0.892 

124928 

non-stationary, 
heavy  tails 

where  1  is  the  log  likelihood  ratio,  and 
threshold  required  to  obtain  a  level  a. 

An  alternative  to  (3)  is  to  use  a  non-par ametric  detector.  A 
detector  is  said  to  be  non-parametric  if  its  level  a  can  be  deter¬ 
mined  without  knowledge  of  the  specific  distribution  or  functional 
form  of  the  noise  distribution.  Strictly  speaking,  a  detector  which 
is  non-parametric  for  white  noise  may  fail  to  be  noo-parametric 
for  correlated  noise.  All  underwater  environments  to  be  con¬ 
sidered  are  correlated,  thus  we  will  say  a  detector  is  non- 
parametric  if  the  level  a  can  be  determined  without  knowledge  of 
the  functional  form  of  the  noise  distribution,  provided  the  noise  is 
white  !2-6|. 

It  is  clear  that  a  non-parametric  detection  scheme  is  desir¬ 
able  if  there  exists  little  knowledge  of  the  actual  noise  distribu¬ 
tion.  In  addition,  non-parametric  detectors  are  generally  more 
robust  than  parametric  detectors  and  thus  may  be  desirable  for 
an  environment  which  is  non-stationary.  The  general  structure  of 
the  detectors  to  be  considered  is  a  memoryless  nonlinearity  (ZNL) 
defined  is  1,  Wowed  by  an  accumulator  and  a  threshold  com¬ 
parator. 
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Table  1  -  Summary  of  Underwater  Noise  Environments 


2,2  Previous  Work 

Several  different  researchers  have  investigated  the  data  sets 
under  consideration.  For  the  Arctic  data,  the  analyses  by  Dwyer 
[8, 111  “d  Veitch  and  Wilks  (9)  were  used  as  a  basis  for  this  work. 
Dwyer  concluded  that  the  Arctic  data  was  both  non-stationary 
and  non-Gaussian.  In  addition,  he  proposed  a  technique  which 
could  be  applied  to  the  Arctic  data  to  separate  wide-band  and 
narrow-band  processes  from  each  other. 

In  1985  Veitch  and  Wilks  [9j  analyzed  the  Arctic  data. 
They  agreed  with  Dwyer  that  the  data  was  non-stationary  How¬ 
ever,  their  analysis  led  them  to  conclude  the  data  could  be 
modeled  as  a  mixture  process  which  contained  a  Gaussian  back¬ 
ground  component.  They  also  examined  the  sample  variance 
(with  respect  to  the  global  variance),  the  sample  skew,  and  the 
sample  kurtosis  of  each  record  and  concluded  that  the  records 
could  be  grouped  into  four  distinct  categories. 


Nielsen  and  Thomas  [12,13]  proposed  an  alternative  univari¬ 
ate  model  for  the  data  and  examined  memoryleas  parametric 
detection  using  the  model.  The  correlation  between  samples  was 
analyzed,  and  the  performance  of  some  detectors  which  incor¬ 
porated  the  correlation  of  the  data  samples  were  analyzed  and 
found  to  have  a  superior  performance  levei  to  the  memoryless 
detectors.  In  addition,  an  adaptive  pre-processing  algorithm  was 
implemented  which  resulted  in  a  further  improvement. 

A  primary  investigator  of  the  Shrimp,  Merchant,  and 
Seismic  data  sets  under  consideration  is  Willett  [7|.  For  all  data 
types  he  proposed  univariate  models  and  examined  parametric 
detection.  Bouvet  and  Schwartz  [14]  used  the  same  data  sets  as 
[7],  but  were  concerned  primarily  with  fitting  the  data  to  the 
Gauss-Gauss  mixture  and  to  Middelton’s  Class  A  models.  In 
addition,  Machell,  Penrod,  and  Ellis  [10]  have  recently  claimed 
that  the  generalized  Gaussian  model  is  valid  for  Ambient,  Ship¬ 
ping,  Seismic,  and  Antarctic  ocean  noise. 


2.3  Univariate  Model 

The  generalized  Gaussian  family,  which  was  first  introduced 
by  Kanefsky  in  1965  [3],  has  become  a  popular  model  for  under¬ 
water  noise  sources.  Much  of  the  interest  has  be  due  to  the  abil¬ 
ity  to  model  symmetric  processes  which  are  lighter  or  heavier 
tailed  than  Gaussian.  In  addition,  the  family  includes  the  Gaus¬ 
sian  distribution  and  the  Laplacian  distribution.  As  defined  bv 
(4),  the  family  has  a  parameter  for  the  mean,  n,  the  variance,  <r , 
and  the  tail  weight,  c. 
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For  c  =  2  the  density  is  Gaussian  and  for  c  =«  1  the  density  is 
Laplacian.  In  general,  if  the  shape  parameter  c  <  2  then  the 
density  has  heavier  tails  than  Gaussian,  and  for  c  >  2  the  density 
has  lighter  tails  than  Gaussian.  Since  the  relationship  between 
the  shape  parameter  and  the  kurtoeis  is  invertible  for 
c  £  [1.4,  5],  knowledge  of  the  kurtoeis  allows  determination  of 
the  shape  parameter  c. 


3.  NON-PARAMETRIC  DETECTION 


3.1  Motivation 


As  a  result  of  investigations  on  the  generalized  Gaussian 
family  and  its  suitability  to  underwater  environments,  the  perfor¬ 
mance  levels  of  four  different  detectors  for  the  noise  environments 
described  have  been  examined.  Consider  a  measure  of  perfor¬ 
mance  between  detectors  i  and  j  to  be  the  asymptotic  relative 
efficiency  (ARE)  (2) 


ARE(i,»  = 
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where  is  the  efficacy  of  detector  i  provided  it  is  finite.  If 
ARE(i,;)  >  1  then  detector  i  is  superior  to  detector  j. 

The  linear  detector  is  used  as  a  basis  for  comparison  and  is 
both  optimal  and  locally  optimal  for  Gaussian  noise.  The 
memoryleas  transformation  /  and  the  efficacy  can  be  defined  as 

*(*)-£  Z'  W 
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where  o2  is  the  variance  of  the  noise  process.  A  second  detector, 
which  is  commonly  considered,  is  the  sign  detector.  In  addition  to 
being  the  NP  optimal  (non-parametric)  detector  for  zero  median 
noise  and  the  NP  locally  optimal  test  for  Laplace  noise,  the  sign 
detector  is  robust.  Let  u  be  the  unit  step  function  and  /  be  the 
univariate  density  of  the  noise,  then  the  ZNL  and  the  efficacy  for 
the  sign  test  can  be  defined  as 

'(l) -£«(*<)  «-r.-4/,(0)  (7) 
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The  linear  rank  (Wilcoxon-Mann- Whitney)  detector  is  a 
non-parametric  detector  which  incorporates  information  about 
the  relative  sizes  of  each  sample  by  assigning  a  ranking  r  which 
lies  between  one  and  M  and  is  dependent  on  the  magnitude  of  a 
sample  with  respect  to  other  samples.  The  linear  rank  detector  is 
more  robust  than  the  linear  detector  and  it  uses  more  information 
about  each  sample  in  forming  the  test  statistic  than  does  the  sign 
detector.  The  ZNL  and  the  efficacy  for  this  detector  can  be  writ¬ 
ten  as  [5] 

lU)  “  £r(kl)  «(*.-)  fr.^=12 
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In  addition  to  the  linear,  sign,  and  rank  detectors,  uniform 
and  non-uniform  quantizer  detectors  were  also  examined  for  three 
of  the  four  environments.  Since  the  results  of  the  non-uniform 
quantizer  were  very  similar  to  those  of  the  uniform  quantizer, 
only  the  uniform  quantizer  will  be  considered  henceforth.  Define 
the  quantizer  as  follows 
n 

l(l)  -  E  ?(*.')  <?(*.)  =  lj  lf  */- 1  <  i  <  tj  (9) 

i-l 

where  L  is  the  number  of  levels,  l;  j  =  1,  2 are  the  vertical 

break  points,  and  j  =*  0,  1 . L  are  the  horizontal  break  points 

with  to  ”  — oo  and  ty  »  oo.  For  a  uniform  quantizer  with  a  verti¬ 
cal  spacing  of  Q  <r/5  between  levels,  we  have  that 
lj=  — j(L— 1)/2  — y+l]<3  and  ty«  — [  L/1  —j \ Q  As  with  the  linear 
rank  detector,  the  quantizer  is  more  robust  than  the  linear  detec¬ 
tor  and  retains  more  information  in  the  test  statistic  about  the 
distribution  of  the  samples  than  the  sign  detector.  For  a  constant 
signal  and  a  symmetric,  zero  mean  noise  source,  the  efficacy*  can 
be  written  as; 

(e  /;(/(<,-.)-/(<>))} 
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Where  /  is  the  univariate  density  and  F  is  the  cumulative  distri¬ 
bution  function  of  the  noise.  A  derivation  of  the  efficacy  can  be 
found  in  Kaasam  [2],  It  is  clear  that  if  the  number  of  levels  is 
optimized  for  a  particular  application  the  quantizer  will  always 
perform  as  well  as  the  sign  detector.  Thus  we  will  not  examine 
the  ARE  with  respect  to  the  quantizer  detector.  In  addition,  note 
that  the  linear  detector  and  the  uniform  quantizer  detector  are 
parametric  detectors. 

For  the  generalized  Gaussian  family  of  densities  the  effica¬ 
cies  are  well  defined.  Thus  the  ARE  between  two  detectors  can 
be  computed  and  found  to  be 

ARE( sign,  /in)  -  c2  (11) 

ARE(rani,  sign)  •  3  ■  2-l/,e 
ARE(rank,  tin)  —  ARE(jign,  tin)  ■  ARE(ronk,  sign) 

Figure  1  contains  plots  of  the  ARE  vs  shape  parameter  c  for  dif¬ 
ferent  pairs  of  detectors.  Notice  that  the  sign  detector  is  better 
than  either  the  linear  or  rank  detectors  for  heavy  tailed  noise,  i  e. 
c  <1.4.  When  the  noise  is  lighter  tailed  than  Laplacian  but 
heavier  tailed  than  Gaussian,  the  rank  detector  appears  to  be  the 
best.  Finally,  for  noise  which  is  Gaussian  or  lighter  tailed  than  pi 
Gaussian,  the  linear  detector  is  the  best  while  the  rank  detector  r‘ 
will  have  a  comparable  performance.  J 


3.2  Simulation  Techniques 

A  vector  of  length  2 M  is  taken  from  the  data  and  we  form 
2l°K*i,  *s,  ..,  zur-i)  the  received  vector  under  hypothesis  H0 
and  +  »,  *4  +  *  ,  ....  xd4  +  s)  the  received  vector  under 

hypothesis  Hy  Note  that  the  two  data  vectors  and  X1  f»ch 
have  sampling  rates  which  are  half  of  the  original  sampling  rate 
of  the  data.  Since  each  data  set  examined  was  over  sampled  by  a  __ 
factor  of  at  least  two,  for  no  signal  the  two  data  vectors  are 
identically  distributed  though  statistically  dependent  on  each 
other.  To  compute  a  and  0,  the  functions  /(2£°)  and  l{Xx) 
computed,  compared  to  a  fixed  threshold,  and  scaled  by  the 
number  of  iterations. 
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ARE(rank,  sign) 
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ARE(sign,  tin) 


All  of  the  data  seta  except  the  Merchant  Shipping  noise  can 
be  shown  to  be  non-st a tionary.  However,  only  the  Arctic  data 
can  be  shown  to  be  composed  of  at  least  two  different  rrcdes 
where  each  mode  lasts  a  significant  period  of  time.  Thus,  with 
the  exception  of  the  Arctic  under-ice  noise,  the  block  size  is 
defined  to  be  the  entire  data  set.  In  addition,  the  signal  power  is 
determined  by  the  global  variance  of  the  data  set,  a,  —  a /VM. 
Simulations  were  made  foi  a  variable  signal  power  and  smaller 
groupings  of  records.  However,  all  detectors  had  relative  perfor¬ 
mance  levels  similar  to  those  for  the  fixed  signal  power  case  with 
the  only  significant  difference  being  a  lower  performance  level 
overall.  Since  the  Arctic  under-ice  data  can  be  separated  into  four 
distinct  modes,  records  belonging  to  the  same  mode  were 
grouped  together  and  examined  separately.  The  signal  power  was 
defined  on  a  record-by-record  basis.  If  Sj  is  the  ith  element  of  the 
signal  M- vector,  then  Sj-ery/VM,  where  of  denotes  the  noise 
power  of  the  yth  record.  For  all  simulations  environments  a  sig¬ 
nal  length  of  sixteen  was  used. 

3.3  Results 

In  Figure  2  the  results  of  the  simulations  for  the  Shrimp 
noise  are  shown.  Table  1  indicates  that  the  noise  is  heavy  tailed 
with  respect  to  Gaussian,  yet  the  sign  detector  has  the  worst  per¬ 
formance.  In  particular,  if  the  uniform  quantizer  detector  is  used, 
it  is  found  that  by  using  16  levels  the  uniform  quantizer  performs 
better  than  the  rank  detector  and  nearly  as  well  as  the  linear 
detector.  In  addition,  notice  that  the  performance  level  of  the 
rank  detector  is  similar  to  that  of  the  linear  detector. 


Figure  2.  Performance  Levels  for  Shrimp  Noise 


For  the  Merchant  Shipping  noise,  the  results  were  as 
expected.  The  shipping  noise  is  lighter  tailed  than  Gaussian,  thus 
it  is  expected  that  the  linear  detector  would  have  the  best  perfor¬ 
mance.  Indeed,  as  shown  in  Figure  3,  the  linear  detector  is  best 
though  the  performance  of  the  rank  detector  is  nearly  identical  to 
that  of  the  linear  detector.  If  the  uniform  quantizer  detector  is 
used,  a  small  improvement  over  the  sign  detector  can  be  achieved 
by  use  of  4  levels.  However  if  32  levels  are  used,  the  uniform 
quantizer  and  the  rank  detector  have  performance  levels  which 
are  comparable. 


Figure  3.  Performance  Levels  for  Merchant  Noise 

Both  of  the  Seismic  noise  recordings  have  been  shown  to  be 
heavy  tailed  with  respect  to  Gaussian.  Since  the  results  for  both 
seismic  data  sets  were  similar,  only  the  results  for  the  first  data 
set  will  be  presented.  As  shown  in  Figure  4,  the  sign  detectors 
performance  level,  relative  to  the  levels  of  the  linear  and  rank 
detectors,  has  improved  significantly.  Indeed,  for  some  levels  a. 
the  rank  and  sign  detectors  have  performance  levels  better  than 
the  linear  detectors.  A  uniform  quantizer  detector  with  8  levels 
will  perform  as  well  as  the  rank  detector  for  some  ranges  of  a  and 
will  be  comparable  otherwise. 


Figure  4.  Performance  Levels  for  Seismic  1  Noise 


The  Arctic  data  can  be  grouped  into  four  different  modes. 
The  four  modes  can  be  defined  either  by  the  moments  !8l  or  by 
the  correlation  function  (9).  Both  methods  result  in  essentially  the 
same  groupinp  of  records.  Category  1,  HSQL  is  characterized  by 
records  with  a  High  variance  (with  respect  to  the  global  vari¬ 
ance),  a  SQueezed  or  small  skew,  and  a  Low  (less  than  three)  kur- 
tosis.  Category  2,  HSTN,  is  characterized  by  records  with  a 


STretched  or  large  skew,  and  a  Normal  (about  three)  kurtoeis. 
Category  3,  HSTL,  is  the  same  a  Category  2  except  these  records 
have  a  Low  kurtoeis.  Finally,  Category  4,  NNN,  has  Normal 
variance,  skew,  and  kurtoeis.  For  all  modes,  the  linear  detector 
had  the  best  performance  and  the  sign  detector  had  the  worst 
performance.  Once  again  the  rank  detector  performed  nearly  as 
well  as  the  linear  detector.  Figures  5  and  6  shown  the  perfor¬ 
mance  levels  the  the  linear  rank  detector  for  the  four  different 
modes  and  the  performance  levels  of  the  linear  rank,  linear,  and 
sign  detectors  for  the  Category  NNN. 


Figure  5.  Performance  Levels  for  Linear  Rank 
Detector  for  Arctic  Noise 


Figure  8.  Performance  Levels  for  Arctic  NNN  Noise 


CONCLUSIONS 

The  results  indicate  that,  while  the  linear  detector  almost 
always  has  the  best  performance  level,  the  linear  rank  detector 
has  a  performance  level  which  is  similar,  in  general.  In  addition, 
a  Irlevel  uniform  quantizer,  which  is  simpler  than  the  rank  detec¬ 
tor,  can  have  performance  levels  which  are  comparable  to  those 
of  the  linear  and  the  rank  detectors.  Surprisingly,  even  in  environ¬ 
ments  which  are  heavy  tailed  such  as  the  shrimp  and  seismic 
noise,  the  sign  detector  had  a  relatively  poor  performance.  It  was 
expected  that  the  sign  detector  would  have  a  better  performance 
level  than  either  the  rank  or  linear  detectors,  though  this  was  not 
the  case.  It  is  important  to  emphasize  that  for  complex  and  non¬ 
stationary  environments,  such  as  those  examined,  either  the  rank 
detector  or  the  quantizer  detector  may  be  the  wisest  choice  from 
the  standpoint  of  robustness  and  simplicity.  The  rank  detector  is 
particularly  attractive  since  no  ootimization  need  be  done  to 
determine  the  number  of  levels  or  the  spacing  between  levels. 
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